|
|
Accession Number |
TCMCG075C20352 |
gbkey |
CDS |
Protein Id |
XP_007025631.1 |
Location |
join(23853473..23853718,23854399..23854620,23854708..23854998,23855120..23855317,23855618..23855682,23855779..23855865,23855940..23856060,23856154..23856198,23856294..23856431,23856586..23856658,23856829..23856905,23857003..23857110,23857195..23857311,23857564..23857719,23857801..23857905,23858032..23858157,23858249..23858326,23858411..23858506,23858599..23858714,23858838..23858907,23859308..23859361,23859456..23859599,23859745..23860044,23860341..23860763,23860879..23860992) |
Gene |
LOC18596855 |
GeneID |
18596855 |
Organism |
Theobroma cacao |
|
|
Length |
1189aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007025569.2
|
Definition |
PREDICTED: DNA-directed RNA polymerase II subunit RPB2 [Theobroma cacao] |
CDS: ATGGAGGACGACAGTGAGTACGATCCGCAACTTATGGACGACGAAGACGACGAGGAGATCACGCAGGAAGACGCGTGGGCGGTTATCTCAGCTTACTTCGAAGAAAAAGGTCTGGTGCGTCAACAGCTCGACTCGTTCGATGAATTTATCCAAAACACTATGCAAGAAATCGTCGACGAATCGGCCGATATTGAGATCAGGCCAGAGTCACAGCACAATCCTGGTCACCAGTCCGACTTTGCTGAGACTATCTATAAGATTAGCTTTGGTCAGATCTACCTTAGTAAACCTATGATGACCGAGTCAGATGGTGAAACTGCAACTTTATTTCCAAAAGCTGCAAGGTTGAGGAATCTTACTTACTCAGCTCCATTGTATGTCGATGTAACTAAGAGAGTTATAAAGAAAGGGCATGATGGTGAAGAAGTCACTGAGACTCAGGATTTTACTAAAGTGTTCATTGGGAAGGTTCCTATAATGCTCCGGTCAAGTTATTGCACACTATATCAAAATTCAGAGAAGGATCTGACCGAGCTTGGGGAGTGTCCATATGATCAAGGTGGGTATTTCATTATCAATGGGAGTGAAAAGGTTCTAATTGCTCAGGAGAAGATGAGCACAAATCATGTCTATGTCTTCAAAAAGAGGCAGCCGAACAAATATGCCTATGTGGCAGAAGTTCGGTCCATGGCAGAGTCCCAGAATAGGCCACCAAGTACCATGTTTGTGCGGATGCTTTCTCGGACTAGTGCCAAAGGGGGCTCTTCGGGGCAGTACATTCGTGCTACTCTTCCATATATTCGGACTGAAATTCCTATCATAATTGTCTTTCGGGCTTTGGGATTTGTTGCTGACAAGGACATATTAGAGCATATATGCTATGACTTCTCCGACACCCAGATGATGGAGTTGCTTAGGCCTTCCTTAGAAGAAGCATTTGTGATTCAAAACCAGCAGGTTGCACTAGATTATATTGGTAAAAGAGGAGCAACTGTTGGTGTTACCAGAGAAAAGAGGATTAAGTATGCTAAAGAGATCCTCCAAAAAGAAATGCTTCCTCACGTAGGTGTTGGAGATTTTTGCGAGACAAAGAAAGCTTATTATTTTGGATATATTATTCACCGGCTGCTTCTTTGTGCACTTGGCCGGAGGGCGGAAGATGATAGAGATCATTATGGCAACAAGAGGTTGGACCTTGCTGGTCCATTACTTGGAGGCCTCTTTAGAATGCTTTTTCGGAAGTTAACTAGGGATGTGAGATCTTATGTGCAGAAGTGTGTTGATAACGGGAAGGATGTGAACCTGCAATTTGCTATCAAAGCGAAAACTATTACAAGTGGTCTTAAATACTCACTTGCTACTGGAAATTGGGGGCAAGCAAATGCAGCTGGTACTAGAGCTGGAGTGTCACAGGTGTTAAACCGTTTGACATATGCCTCAACTTTGTCACACTTGCGAAGGCTCAATTCTCCTATAGGACGTGAAGGGAAATTGGCTAAACCACGTCAGTTGCATAATTCACAGTGGGGAATGATGTGTCCAGCGGAAACACCGGAAGGACAGGCCTGTGGACTTGTAAAGAATCTTGCCTTGATGGTATACATAACTGTCGGATCAGCTGCATATCCTATTCTTGAATTTTTGGAAGAGTGGGGTACGGAGAATTTTGAGGAAATCTCACCTGCAGTTATCCCTCAAGCTACAAAAATTTTTGTCAATGGTTGCTGGGTTGGTGTACATCGGAATCCTGATATGCTTGTGACAACATTGAGACGGTTGAGAAGACGGGTTGATGTCAATACTGAAGTTGGTGTTGTTAGAGATATCCGTCTAAAAGAACTTCGAATATATACTGACTATGGTCGTTGCAGTCGACCATTGTTCATCGTGGAGAAACAAAGACTTCTCATAAAGAAGAAAGATATTCATGCACTGCAACAAAGAGAAAGCCCAGAAGACGGTGGCTGGCATGATCTTGTAGCAAAGGGATTTATAGAATACATTGACACGGAAGAAGAGGAGACAACAATGATTTCCATGACCATCAATGATCTTGTACAAGCGAGAGTCAATCCAGAGGAAGCTTATTCTGAAACTTATACCCATTGTGAGATCCACCCTTCATTGATTTTGGGTGTTTGTGCTTCAATTATACCATTTCCTGATCATAATCAGTCCCCGCGTAATACCTATCAATCTGCTATGGGTAAGCAAGCAATGGGAATATATGTTACCAACTACCAATTTCGAATGGATACATTGGCCTATGTTCTCTATTATCCCCAAAAGCCACTTGTTACTACACGAGCTATGGAACATCTCCACTTTCGGCAGCTTCCAGCTGGCATTAATGCTATTGTTGCTATCGCCTGCTATTCTGGATATAACCAAGAAGATTCTGTTATTATGAATCAATCATCAATAGACCGTGGATTCTTCCGATCACTTTTCTTCCGCTCTTACCGAGATGAGGAGAAAAAAATGGGGACCCTTGTTAAAGAAGATTTTGGTCGACCAGATAGGGCTAATACTATGGGAATGAGGCATGGCTCTTATGATAAATTGGATGATGATGGTCTTGCACCTCCTGGAACAAGAGTTTCAGGTGAGGATGTAATCATCGGAAAGACCACCCCGATTTCTCAGGAAGAAGCTCAGGGACAAGCATCACGCTATTCAAGACGTGATCATAGCATAAGCTTACGTCACAGTGAAACAGGCATAGTGGACCAAGTTCTATTGACAACTAATGCTGATGGGTTGAGATTTGTGAAAGTAAGGGTAAGATCTGTTCGCATTCCCCAGATTGGGGACAAGTTTAGCAGTAGACATGGTCAAAAGGGGACAGTGGGCATGACATACACGCAGGAAGACATGCCTTGGACTGTGGAAGGCATCACACCCGATATCATTGTGAACCCACATGCTATTCCTTCTCGAATGACAATTGGTCAGCTTATTGAATGTATCATGGGGAAAGTTGCAGCTCACATGGGCAAGGAAGGGGATGCCACTCCTTTTACAGATGTCACCGTGGACAATATCAGCAGAGCTCTTCATAAATGTGGATATCAAATGCGTGGTTTTGAGACCATGTATAATGGGCACACAGGCAGGCGCCTTTCTGCTATGATATTTTTGGGGCCCACATATTACCAAAGACTAAAGCACATGGTTGATGATAAGATCCATTCTCGTGGTCGGGGCCCTGTGCAGATCCTGACAAGGCAGCCTGCAGAGGGACGATCCCGTGATGGTGGTCTCCGTTTCGGAGAGATGGAAAGAGATTGCATGATTGCGCATGGTGCTGCTCATTTCCTTAAAGAGAGATTGTTTGACCAAAGTGATGCATACAGGGTCCATGTGTGCGAGCGTTGTGGGTTGATTGCTATTGCAAATCTAAAGAAGAACTCATTTGAGTGCAGAGGATGCAAGAATAAAACTGATATTGTTCAGGTATACATTCCTTACGCCTGTAAGCTGCTCTTCCAAGAGCTTATGGCCATGGCAATTGCTCCAAGAATGCTCACAAAGGAACCTCCCAAAGACCAAAAGAAGAAAGGAGCCTGA |
Protein: MEDDSEYDPQLMDDEDDEEITQEDAWAVISAYFEEKGLVRQQLDSFDEFIQNTMQEIVDESADIEIRPESQHNPGHQSDFAETIYKISFGQIYLSKPMMTESDGETATLFPKAARLRNLTYSAPLYVDVTKRVIKKGHDGEEVTETQDFTKVFIGKVPIMLRSSYCTLYQNSEKDLTELGECPYDQGGYFIINGSEKVLIAQEKMSTNHVYVFKKRQPNKYAYVAEVRSMAESQNRPPSTMFVRMLSRTSAKGGSSGQYIRATLPYIRTEIPIIIVFRALGFVADKDILEHICYDFSDTQMMELLRPSLEEAFVIQNQQVALDYIGKRGATVGVTREKRIKYAKEILQKEMLPHVGVGDFCETKKAYYFGYIIHRLLLCALGRRAEDDRDHYGNKRLDLAGPLLGGLFRMLFRKLTRDVRSYVQKCVDNGKDVNLQFAIKAKTITSGLKYSLATGNWGQANAAGTRAGVSQVLNRLTYASTLSHLRRLNSPIGREGKLAKPRQLHNSQWGMMCPAETPEGQACGLVKNLALMVYITVGSAAYPILEFLEEWGTENFEEISPAVIPQATKIFVNGCWVGVHRNPDMLVTTLRRLRRRVDVNTEVGVVRDIRLKELRIYTDYGRCSRPLFIVEKQRLLIKKKDIHALQQRESPEDGGWHDLVAKGFIEYIDTEEEETTMISMTINDLVQARVNPEEAYSETYTHCEIHPSLILGVCASIIPFPDHNQSPRNTYQSAMGKQAMGIYVTNYQFRMDTLAYVLYYPQKPLVTTRAMEHLHFRQLPAGINAIVAIACYSGYNQEDSVIMNQSSIDRGFFRSLFFRSYRDEEKKMGTLVKEDFGRPDRANTMGMRHGSYDKLDDDGLAPPGTRVSGEDVIIGKTTPISQEEAQGQASRYSRRDHSISLRHSETGIVDQVLLTTNADGLRFVKVRVRSVRIPQIGDKFSSRHGQKGTVGMTYTQEDMPWTVEGITPDIIVNPHAIPSRMTIGQLIECIMGKVAAHMGKEGDATPFTDVTVDNISRALHKCGYQMRGFETMYNGHTGRRLSAMIFLGPTYYQRLKHMVDDKIHSRGRGPVQILTRQPAEGRSRDGGLRFGEMERDCMIAHGAAHFLKERLFDQSDAYRVHVCERCGLIAIANLKKNSFECRGCKNKTDIVQVYIPYACKLLFQELMAMAIAPRMLTKEPPKDQKKKGA |